NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Approximation-Aware Bayesian Optimization

Maus, Natalie; Kim, Kyurae; Pleiss, Geoff; Eriksson, David; Cnningham, John P; Gardner, Jacob R (December 2024, Neural Information Processing Systems (NeurIPS 2024))

High-dimensional Bayesian optimization (BO) tasks such as molecular design often require > 10,000 function evaluations before obtaining meaningful results. While methods like sparse variational Gaussian processes (SVGPs) reduce computational requirements in these settings, the underlying approximations result in suboptimal data acquisitions that slow the progress of optimization. In this paper we modify SVGPs to better align with the goals of BO: targeting informed data acquisition rather than global posterior fidelity. Using the framework of utility-calibrated variational inference, we unify GP approximation and data acquisition into a joint optimization problem, thereby ensuring optimal decisions under a limited computational budget. Our approach can be used with any decision-theoretic acquisition function and is compatible with trust region methods like TuRBO. We derive efficient joint objectives for the expected improvement and knowledge gradient acquisition functions in both the standard and batch BO settings. Our approach outperforms standard SVGPs on high-dimensional benchmark tasks in control and molecular design.
more » « less
Full Text Available
Large-Scale Gaussian Processes via Alternating Projection

Wu, Kaiwen; Wenger, Jonathan; Jones, Haydn; Pleiss, Geoff; Gardner, Jacob R (May 2024, Conference on Artificial Intelligence and Statistics (AISTATS 2024))

Training and inference in Gaussian processes (GPs) require solving linear systems with n × n kernel matrices. To address the prohibitive O(n3) time complexity, recent work has employed fast iterative methods, like conjugate gradients (CG). However, as datasets increase in magnitude, the kernel matrices become increasingly ill-conditioned and still require O(n2) space without partitioning. Thus, while CG increases the size of datasets GPs can be trained on, modern datasets reach scales beyond its applicability. In this work, we propose an iterative method which only accesses subblocks of the kernel matrix, effectively enabling mini-batching. Our algorithm, based on alternating projection, has O(n) per-iteration time and space complexity, solving many of the practical challenges of scaling GPs to very large datasets. Theoretically, we prove the method enjoys linear convergence. Empirically, we demonstrate its fast convergence in practice and robustness to ill-conditioning. On large-scale benchmark datasets with up to four million data points, our approach accelerates GP training and in- ference by speed-up factors up to 27× and 72×, respectively, compared to CG.
more » « less
Full Text Available
A nearby dark molecular cloud in the Local Bubble revealed via H2 fluorescence

https://doi.org/10.1038/s41550-025-02541-7

Burkhart, Blakesley; Dharmawardena, Thavisha E; Bialy, Shmuel; Haworth, Thomas J; Cruz_Aguirre, Fernando; Jo, Young-Soo; Andersson, B-G; Chung, Haeun; Edelstein, Jerry; Grenier, Isabelle; et al (April 2025, Nature Astronomy)

Free, publicly-accessible full text available April 28, 2026
CoLA: Exploiting Compositional Structure for Automatic and Efficient Numerical Linear Algebra

Potapczynski, Andres; Finzi, Marc; Pleiss, Geoff; Wilson, Andrew (December 2023, Conference on Neural Information Processing Systems (NeurIPS) 2023)

Many areas of machine learning and science involve large linear algebra problems, such as eigendecompositions, solving linear systems, computing matrix exponentials, and trace estimation. The matrices involved often have Kronecker, convolutional, block diagonal, sum, or product structure. In this paper, we propose a simple but general framework for large-scale linear algebra problems in machine learning, named CoLA (Compositional Linear Algebra). By combining a linear operator abstraction with compositional dispatch rules, CoLA automatically constructs memory and runtime efficient numerical algorithms. Moreover, CoLA provides memory efficient automatic differentiation, low precision computation, and GPU acceleration in both JAX and PyTorch, while also accommodating new objects, operations, and rules in downstream packages via multiple dispatch. CoLA can accelerate many algebraic operations, while making it easy to prototype matrix structures and algorithms, providing an appealing drop-in tool for virtually any computational effort that requires linear algebra. We showcase its efficacy across a broad range of applications, including partial differential equations, Gaussian processes, equivariant model construction, and unsupervised learning.
more » « less
Full Text Available
Harnessing interpretable and unsupervised machine learning to address big data from modern X-ray diffraction

https://doi.org/10.1073/pnas.2109665119

Venderley, Jordan; Mallayya, Krishnanand; Matty, Michael; Krogstad, Matthew; Ruff, Jacob; Pleiss, Geoff; Kishore, Varsha; Mandrus, David; Phelan, Daniel; Poudel, Lekhanath; et al (June 2022, Proceedings of the National Academy of Sciences)

The information content of crystalline materials becomes astronomical when collective electronic behavior and their fluctuations are taken into account. In the past decade, improvements in source brightness and detector technology at modern X-ray facilities have allowed a dramatically increased fraction of this information to be captured. Now, the primary challenge is to understand and discover scientific principles from big datasets when a comprehensive analysis is beyond human reach. We report the development of an unsupervised machine learning approach, X-ray diffraction (XRD) temperature clustering (X-TEC), that can automatically extract charge density wave order parameters and detect intraunit cell ordering and its fluctuations from a series of high-volume X-ray diffraction measurements taken at multiple temperatures. We benchmark X-TEC with diffraction data on a quasi-skutterudite family of materials, (Ca x Sr 1 − x ) 3 Rh 4 Sn 13 , where a quantum critical point is observed as a function of Ca concentration. We apply X-TEC to XRD data on the pyrochlore metal, Cd 2 Re 2 O 7 , to investigate its two much-debated structural phase transitions and uncover the Goldstone mode accompanying them. We demonstrate how unprecedented atomic-scale knowledge can be gained when human researchers connect the X-TEC results to physical principles. Specifically, we extract from the X-TEC–revealed selection rules that the Cd and Re displacements are approximately equal in amplitude but out of phase. This discovery reveals a previously unknown involvement of 5 d 2 Re, supporting the idea of an electronic origin to the structural order. Our approach can radically transform XRD experiments by allowing in operando data analysis and enabling researchers to refine experiments by discovering interesting regions of phase space on the fly.
more » « less
Full Text Available
Pseudo-LiDAR++: Accurate Depth for 3D Object Detection in Autonomous Driving

You, Yurong; Wang, Yan; Chao, Wei-Lun; Garg, Divyansh; Pleiss, Geoff; Hariharan, Bharath; Campbell, Mark; Weinberger, Kilian Q. (April 2020, International Conference on Learning Representations (ICLR))
null (Ed.)
Full Text Available

Search for: All records